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TITLE OF THE INVENTION 

APPARATUS FOR RECORDING OR REPRODUCING MULTIMEDIA DATA USING 
HIERARCHICAL INFORMATION STRUCTURE AND INFORMATION STORAGE MEDIUM 

THEREOF 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application claims priority of U.S. Provisional Application number 60/452,546, 
filed March 7, 2003 and claims priority of Korean Patent Application No. 2002-54945 filed on 
September 11, 2002, in the Korean Intellectual Property Office, the disclosure of which is 
incorporated herein in its entirety by reference. 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

[0002] The present invention relates to recording and reproducing multimedia data, and more 
particularly, to apparatuses to record and/or reproduce multimedia data using a hierarchical 
information structure and an information storage medium thereof. 

2. Description of the Related Art 

[0003] A multimedia data recording/reproducing apparatus requires additional information, 
such as attributes of multimedia information or a sequence of data reproduction, to record the 
multimedia information on or to reproduce the multimedia information from a data storage 
medium. FIG. 1 illustrates the conventional multimedia data recording/reproducing apparatus 
200 including the data storage medium 100 and connected to a user output device 300. The 
multimedia data recording/reproducing apparatus 200 may be controlled using a user input 
device 400, such as a remote control. In general, in order to record the additional information, 
additional information tables are formed at a particular position or in a particular file of the 
existing data storage medium. Examples of the data storage medium include compact discs 
(CDs), video CDs, and digital versatile discs (DVDs). In the additional information table, 
information is described by a location and a length of a data field. Thus, a new information table 
needs to be created whenever a new type of multimedia information medium is designed. In 
particular, navigation information, to select a reproduction unit or to determine the reproduction 
sequence, is recorded in table formats on the DVD. However, such navigation information is 
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mixed with the information that defines reproduction units, thereby making it difficult to carry out 
a navigation. 

[0004J The data structure of a conventional DVD will now be described. Referring to FIG. 2, 
a DVD includes a video manager (VMG) area and a plurality of video title set (VTS) areas. 
Control information and title selection information are stored in the VMG area, where the title 
information of a reproduction unit is stored in the plurality of VTS areas. The VMG area 
includes two or three files and each VTS area includes three to twelve files. 

[0005] The VMG area is illustrated in FIG. 3 in detail. The VMG area includes a video 
manager information (VMGi) area io store the additional information regarding the VMG area, a 
video object set (VOBS) area to store video object information of a menu, and a VMGI backup 
area. Each of the above areas includes a single file. The VOBS area may or may not be 
included in the VMG area, but the other two areas, the VMGI area and the VMGI backup area, 
are required. 

[0006] Title information and VOBS are stored in the VTS area. A plurality of titles may be 
recorded in the VTS area. The VTS area is illustrated in detail in FIG. 4. 

[0007] Referring to FIG. 4, video title set information (VTSI), the VOBS for menu, which is a 
video object set for the VTS menu, VOBS for title, which is the video object set for the titles in a 
VTS, and VTSI backup data are recorded in VTS area #n. However, the VOBS for the title may 
not be included in VTS area #n. Each VOBS is divided into a video object (VOB) on cells, which 
are recording units. Each VOB includes the cells. According to an aspect of the present 
invention, the cell is determined to be a lowest-ranked unit of data. 

[0008] A reproduction unit is represented by a hierarchical structure in which a title is present 
at a corresponding top level. The title includes one program chain (PGC) or a plurality of PGCS 
linked to one another. Here, a first PGC is called an entry PGC. FIG. 5 illustrates the 
reproduction unit one_sequential_PGCJitle, including only a single PGC (an entry PGC). FIG. 
6 illustrates the reproduction unit in which the title is linked to the plurality of PGCs. After 
reproduction of the PGC, another PGC is reproduced by selecting from at least one of several 
PGCs. In this case, a selection command may be stored in program chain information (PGCI). 
Controlling the sequence of PGC reproduction is called the navigation. FIG. 7 illustrates the 
structure of the PGC. The PGC is stored in an information structure described as a PGCI 
format. The PGCI includes a pre-command in which navigation commands are stored, a post- 
command, and a plurality of program information units. The pre-command is carried out before 
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the reproduction of a related PGC and the post-command is carried out after the reproduction of 
the PGC. Each program information unit includes a plurality of cell information units, each cell 
linked to the cell in the VOB, which is the recording unit. Each cell included in each 
reproduction unit has a cell command that is carried out after reproduction of the cell. 
Therefore, the PGCI represents a hierarchical reproducing structure of the PGC, i.e., the 
reproduction unit, in which the lowest-ranked reproduction unit cell is linked to the lowest-ranked 
record unit cell. 

[0009] FIG. 8 illustrates a case of branching a new PGC using command navigation 
information during or after reproduction of the PGC. In other words, it is possible to easily 
control the sequence of the PGC reproduction using navigation commands such as 
LinkPrevPGC, LinkTopPGC, LinkNextPGC, LinkTailPGC, LinkGoUpPGC, and LinkPGCN. That 
is, the PGC has reproduction units and also navigation information. 

[0010] A program in the PGC is referenced by a link called a part of title (PTT). 

[001 1] In the case of the DVD, the above information is stored in a binary table format, that 
is, the information is recorded in table formats where the information is recorded within a 
predetermined bit length on a particular position of the table. 

[0012] FIG. 9 illustrates a data structure of a TT_SRPT information table, which is title 
information in the VMGI. Referring to FIG. 9, the leading two bytes of the TT_SRPT information 
indicates a total number of titles n. The next two bytes of the TT_SRPT information is reserved 
for extension information defined in a future standard. The other bytes following the two bytes 
represent TT_SRP information that individually describes the titles. In particular, a VTN number, 
designated by a related title, and a title number in a related VTS are recorded with 
predetermined bit lengths on certain positions of VTSN and VTS_JTN, respectively. 

[0013] FIG. 10 illustrates a data structure of VTS_PTT_SRPT information in the VTSI. 
Referring to FIG. 10, the VTS_PTT_SRPT information includes TTU_SRPs corresponding to a 
number of the titles of the related VTS. The respective TTU_SRPs include information to 
designate one of the PTT__SRPs following the TTU_SRPs. Therefore, a PTT_SRP between a 
PTT_SRP, which is designated by a TTU_SRP, and a PTT_SRP, which is designated by a next 
TTU_SRP, form the title. That is, a plurality of Part_of_Title Search Pointers PTT-SRPs are 
present in each title, resulting in a division of the title into several PTTs. The PTT may be 
viewed as a chapter, for easier understanding. FIG. 11 illustrates contents of the PTT_SRP, 
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designating the PGC and a program in the PGC. In conclusion, the title is divided into several 
PTTs and each PTT is linked to a program in the PGC. 

[0014] FIG. 12 illustrates a data structure of a PGCIT VTS_PGCIT, representing a PGCI 
table in the VTSI. Referring to FIG. 12, a total number of the programs and the cells that belong 
to the PGC are stored in a VTS_PGCITI. Also, the VTS_PGCIT stores VTS_PGCIs as many as 
a plurality of VTS_PGCs, which belong to the VTS. FIG. 13 illustrates a detailed data structure 
of the VTS_PGCI. In the VTS_PGCI, various information are recorded within particular bit 
lengths at particular positions in table formats, including a PGC_CMDT, which describes a pre- 
command, a post-command, a cell command, and so on. Also, the VTS_PGCI includes a 
PGC_PGMAP, which indicates program start cells as many as the programs, and a C_POSIT, 
which is the information for linkage of respective cells to respective record units. 

[0015] As described above, in the case of the conventional DVD, which is a multimedia data 
storage medium, the video object data and the data regarding the reproduction units and the 
navigation are recorded as the titles and the PGCs. However, the additional data in the PGCs is 
stored in the table format within particular lengths at a certain position. The commands to 
navigate are also stored in a limited space, together with the additional data, which define 
reproduction units. Therefore, the advent of a new reproduction unit results in a change of the 
table location; thus, making it difficult to implement an extension for the new reproduction. To 
solve this problem, reserved spaces are formed in a plurality of regions, which still limits any 
extension for the future. In the worst-case scenario, a table structure may be redefined. In fact, 
existing multimedia data storage media such as the CDs, the VCDs, the MDs, and the DVDs 
have table structures of their own. 

[0016] Meanwhile, advancements in Internet technologies trigger development of personal 
computers (PCs) capable of processing multimedia. For instance, video object data or audio 
data may be inserted into a document with HyperText Markup Language (HTML) or may be 
reproduced during reproduction of HTML. In this case, the navigation data is described with a 
script language or the like. Therefore, the navigation data may be described separate from 
reproduction data. If there are two types of navigation data, e.g., one controlled using script 
language and another one described in the table format, it is complicated to control both types 
of navigation data. 
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SUMMARY OF THE INVENTION 

[0017] According to an aspect of the present invention, there is provided a data storage 
medium and a data reproduction apparatus. Assuming that the data reproduction apparatus 
operates as specified in the present invention, a recording apparatus records information, 
according to the data structure of which is specified in the present invention, and multimedia 
data on a storage medium. Here, storing operations are understood as identical to recording 
operations. 

[0018] Multimedia data and additional data are recorded in a storage medium, according to 

on oerk^ot r\i the* nrocont inwon+inn Tho nrocont in\/pntinn qi innpstQ spnpratp rprmrdinn of twn 

Ul I UO^OUl Wl VI IV/ W ■ WWWI I * M ■ » Wl • li W w f www. . «. w — — — — w w p. . - Cf ~ ' ~ ' 

different types of additional data: additional information regarding record units, attributes, and 
reproduction units of the multimedia data, and navigation information regarding selection of a 
reproduction unit and a reproduction sequence. The additional information regarding a record 
unit, attributes, or a reproduction unit, with respect to multimedia data, is described using a 
markup language. Accordingly, implementation supporting an extension of a future standard is 
possible even when adding a new type of multimedia data or prescribing a new type of 
recording or reproduction unit, regardless of the standard adopted. Alternatively, the additional 
information may be stored in a binary table format. 

[0019] Also, both or one of a markup language and a script language may be used to 
describe navigation data, which represents selection of a reproduction unit or reproduction 
sequence. A markup language is also used to describe presentation data, which represents a 
menu screen to select a reproduction unit and a screen layout for data reproduction, thereby 
enabling a menu structure and navigation with a high degree of flexibility. 

[0020] According to an aspect of the present invention, there is provided a multimedia data 
storage medium in which multimedia data is stored. The multimedia data storage medium 
includes a first layer in which the multimedia data, such as video object images, still images, 
voice, graphics, and texts, is stored; and a second layer in which, when the multimedia data is 
divided into the recording unit and the reproduction unit, information regarding attributes of the 
record unit and relationship between the record unit and the reproduction unit are described with 
the markup language using elements and the attributes. 

[0021] The navigation data, which is used to control a selection of the reproduction unit and 
the reproduction sequence, may be recorded on a third layer using the markup language or the 
script language, in addition to the information recorded on the first and second layers. 
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[0022] Additional aspects and/or advantages of the invention will be set forth in part in the 
description which follows and, in part, will be obvious from the description, or may be learned by 
practice of the invention , 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0023] The above and/or other aspects and advantages of the present invention will become 
more apparent by describing in detail a preferred embodiment thereof with reference to the 
attached drawings in which: 

FIG. 1 illustrates a conventional multimedia data recording/reproducing apparatus; 
FIG. 2 illustrates a data structure of a conventional DVD; 
FIG. 3 illustrates a VMG area; 
FIG. 4 illustrates a VTS area; 

FIG. 5 illustrates a reproduction unit one_sequential_PGCJitle, including only a single 

PGC; 

FIG. 6 illustrates the reproduction unit in which a title is linked to a plurality of PGCs; 
FIG. 7 illustrates a structure of the PGC; 

FIG. 8 illustrates a case of branching a new PGC using command navigation information 
during or after reproduction of the PGC; 

FIG. 9 illustrates a data structure of a TT__SRPT information table; 

FIG. 10 illustrates a data structure of VTS_PTT_SRPT information in a VTSI; 

FIG. 11 illustrates contents of a PTT_SRP, designating the PGC and a program in the 

PGC; 

FIG. 12 illustrates a data structure of a PGCIT VTS_PGCIT, representing a PGCI table in 
the VTSI; 

FIG. 13 illustrates a detailed data structure of a VTS_PGCI; 
FIG. 14 illustrates a filel.mpg and a file2.mpg, in accordance with an aspect of the 
present invention; 

FIG. 15 illustrates two video object clips, in accordance with an aspect of the present 
invention; 

FIG. 16 illustrates a position of video object data at a time gap position recorded in a 
table format, in accordance with an aspect of the present invention; 

FIG. 17 illustrates a video object file, in accordance with an aspect of the present 
invention; 
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FIG. 18 illustrates the reproducing apparatus, in accordance with an aspect of the 
present invention; and 

FIG. 19 illustrates a method of forming a menu screen for navigation, in accordance with 
an aspect of the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0024] Reference will now be made in detail to the present aspects of the present invention, 
examples of which are illustrated in the accompanying drawings, wherein like reference 
numerals refer to the like elements throughout. The aspects are described below in order to 
explain the present invention by referring to the figures. [Preferred embodiments of the present 
invention will be described with respect to a storage medium on which a video object title is 
recorded, being supported by a plurality of voice types and subtitles.] 

[0025] According to an aspect of the present invention, a storage medium on which a video 
object title is recorded, being supported by a plurality of voice types and subtitles. Additional 
information may be hierarchically recorded and a markup language is used to implement each 
layer for extensibility. The markup language, which describes record units and reproduction 
units, is called a media description language. According to an aspect of the present invention, 
the record unit and the reproduction unit are functionally divided as individual units. 

[0026] Referring to FIG. 14, filel .mpg, which is a first half of data representing a video object 
title, and file2.mpg, which is the other half, are recorded in a first layer. A title may be divided 
into a plurality of files given a size limit of a chapter unit or a file. Video object data is 
compressively encoded to reduce a data amount thereof. MPEG, which is one of the most 
popular motion picture compression methods, supports a variable bit rate (VBR) encoding 
method in which a bit rate per hour varies according to an amount of video information. A time 
search determining a starting point of the data reproduction with consideration to a given time 
requires table-type information to map the time and a position of the data. The information is 
used to detect the position of desired data since a predetermined time after a start of the data 
reproduction. In general, the table-type information includes information regarding data 
positions measured at every predetermined point of time. For instance, the table-type 
information may be time map information that represents temporal position linking information 
indicating the data positions measured with respect to the file beginning every 10 seconds. 
However, because the amount of time map information is significant, the information is recorded 
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in a binary table format, rather than using the markup language, and stored in the first layer. 
FIG. 14 illustrates the time map information file1timemap.dat and file2timemap.dat, regarding 
the video object data filel .mpg and file2.mpg, respectively. According to an aspect of the 
present invention, actual time map information is illustrated in FIG. 16. 

[0027] Additional data, regarding multimedia data recorded on the first layer, is recorded on a 
second layer. The additional data defines a reproduction unit to appropriately reproduce the 
multimedia data on the first layer. Here, the reproduction unit may be divided into record units 
or units of storage, which are described in a binary table format, or in an alternative aspect of 
the present invention, using the markup language, and stored as a description. xml file. 
Navigation information, which is to be added to the additional data, is recorded on a third layer 
as a menu.xml file. The stored navigation information determines a selection and sequences of 
the data reproduction by controlling the reproduction unit recorded on the second layer. 

[0028] In general, a menu screen is organized in a recording medium, on which a plurality of 
titles or chapters are recorded, to allow random access of a particular title or chapter in a user 
selection and immediate reproduction at a specified position. In the menu screen, still images 
and buttons are generally formed. Also, background music may be reproduced. When a user 
selects a button, a function associated with the button is executed. Referring to FIG. 14, the still 
images and music data included in the menu screen are recorded as file3.jpg and file4.mp3 files 
on the first layer. 

[0029] The additional data recorded on the second layer describes the information regarding 
the data recorded on the first layer. Video object data is multiplexed in a data stream to 
synchronize video, audio, and graphic data. The attributes of the record units of the video 
object data are described as description.xml, using the attributes of the video, audio, and 
graphic data, and numbers of audio, video, and graphic data. Further, the additional data 
provides information regarding reproduction units that are generated by a combination or a 
selection of the record units. 

[0030] In an aspect of the present invention, a 'clip' is used as the record unit and 'cells', 
'chapters', and 'titles' are used as the reproduction units. The following description provides 
explanations on these units (see FIG. 15): clip: The clip is an object described in relation to the 
recording of the multimedia data. FIG. 15 illustrates two video object clips. The video object 
clip has information about the time and the position. The data belonging to the clip can be 
continuously reproduced. That is, an mpg file including the video object data and a time map 
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file including the temporal position information are combined to form the clip. The time map file 
includes the additional information that enables a quick search for a desired temporal position of 
the video object data when the video object is VBR encoded. 

[0031] That is, if the video object file is the VBR encoded as shown in FIG. 17, the position of 
the video object data at a time gap position, designated by predetermined time intervals from a 
starting position, is recorded in the table format shown in FIG. 16. If the data position in the 
table is called an entry, a total number of entries and time gaps may be recorded at a beginning 
of the table. A search of a desired position of the data, with respect to a predetermined instant 
of time, can be accomplished by detecting the time gap position most proximate to a 
predetermined instant of time recorded in the table. The desired data may be precisely 
reproduced by reading the data starting from the detected position. If the data is recorded at a 
constant bit rate (CBR), it is possible to maintain the coded data amount generated for a 
predetermined time. In this case, the clip can be constructed with only video object data 
because it is possible to detect the desired position of the data using the time calculations 
according to the CBR without the time map information. 

[0032] The following information is used to define the video object clip including the video 
data, a plurality of audio data groups, and a plurality of graphic data groups: 

(a) general attributes: clip identification information, video object clip files, and clip 
temporal position information file (time map file); 

(b) video: screen size (e.g., 1920'1080, 1280720, and 720'480), average bit rate (e.g., 
4M, 6M, 10M, and 20M), screen output rate (e.g., 60Hz, 30Hz, and 24Hz), scanning types (e.g., 
progressive and interlaced types); 

(c) audio: audio stream identification information, audio encoding information, linguistic 
attributes of each audio data group (e.g., Korean, and English), and application attributes of 
each audio data group (e.g., main audio, sub audio, and commentary); and 

(d) graphic: graphic stream identification information, graphic encoding information, 
linguistic attributes of each graphic data group (e.g., Korean, and English), application attributes 
of each graphic data group (e.g., subtitles, and animation) 

[0033] The above information, however, may be changed according to the recording type, 
encoding type, and multiplexing type of the multimedia data. Also, the record unit may form a 
hierarchical structure, and, thus, the record unit subordinate to the clip may be present. For 
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convenience, according to an aspect of the present invention, the record unit is determined to 
be made of clips. 

[0034] In general, the reproduction unit has the hierarchical structure, that is, the 
reproduction unit includes a plurality of reproduction sub-units. In this case, the reproduction 
sub-unit is defined as a unit of a reproduction sequence or a point of random access. 

[0035] Referring to FIG. 15, a cell is the reproduction unit that is described in relation to the 
reproduction of the multimedia data, each cell designating the clip or a portion of the clip. The 
cell represents a lowest layer of the reproduction unit. That is, the cell, i.e., a reproduction unit, 
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[0036] The cell is defined using the identification information, and the starting and ending 
times of the related clip. However, if the cell designates an entire clip, not a portion thereof, the 
starting and ending times are not additionally specified. 

[0037] Referring to FIG. 15, a chapter is the reproduction unit described in relation to the 
reproduction of the data, each chapter including at least one cell. In general, the chapter is 
defined by a chapter name, chapter identification information, and information regarding at least 
one cell belonging to the chapter. 

[0038] According to an aspect of the present invention, the chapter can be understood as a 
reproduction point that allows the user to perform the random access. Thus, within the title, the 
user can search for or reproduce the desired data in units of chapters. In general, the menu 
screen provides a menu in which the desired data can be selected in the units of chapters. 

[0039] Referring to FIG. 15, a title includes a plurality of chapters. A plurality of titles may be 
stored in a storage medium and can be sequentially reproduced according to a sequence of title 
identification signs. In general, the menu screen provides a menu that allows selection of the 
title, the title defined by a title name, a title identification sign, and information regarding at least 
one chapter belonging to the title. 

[0040] FIG. 15 illustrates a relationship between video object data recorded on the first layer, 
and clips, chapters, and titles recorded on the second layer. 

[0041] According to an aspect of the present invention, the information regarding the second 
layer is described in two types of units, i.e., the record units and the reproduction units, while the 
data regarding the navigation is eliminated. In general, the data is described in a binary table 
format or through the markup language. In an aspect of the present invention, the data is 
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described using the markup language because the markup language is more advantageous 
than the binary table. Extensible markup language (XML), a representative example of the 
markup language, is defined in W3C, which prescribes recommendations for the Internet. With 
XML, it is possible to describe a variety of databases and documents. When information 
regarding the second layer is described with XML, it is very easy to ensure extendibility and 
backward compatibility. 

[0042] Hereinafter, the XML-based language will be defined describing the additional 
information to be stored in the second layer, according to an aspect of the present invention. 
The XML-based language is referred to as media description language (MDL). XML is 
described with a combination of hierarchical elements. Also, each element may have several 
attributes. The name of the element is described within a sign '<>'. The regulations to describe 
a sub element are mentioned below. 

[0043] The MDL, according to an aspect of the present invention, has elements and 
attributes as described below. Here, a document is understood as a unit of data that is 
described and stored utilizing the markup language. 

(a) The uppermost element of the MDL document is described using <mdl> or an equivalent 
value. 

(b) The element <mdl> may have the following sub-elements: 

- <head> 

- <body> 

(c) An element <head> contains all information regarding a storage unit and may have the 
following sub-element: 

- <meta> 

(d) An element <meta> defines a blank element in which features of a document are defined 
and appropriate values are allocated to the features. Each <meta> element denotes a pair 
including an attribute and a value. 

[0044] The attributes of elements defined in the element <meta> are as follows: 

- Name: A name denotes a document feature defined in the element <meta>. The name is an 
attribute indispensable to the element <meta>. 

- Content: A content denotes a feature value defined in the element <meta>. The content is 
also an attribute indispensable to the element <meta>. 
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[0045] The following are examples of the element <meta> excluding conventional examples 
of the element <meta>: e.g., (i) <meta name="type" content="mdl-disc" />, which describes a 
disc that is manufactured using the media description language; and (ii) <meta name- 'region" 
content="1" />, which indicates that the regional code of the disc is 1 . 

[0046] (e) The element <body> may have the following sub-elements: 

- <clip>, and 

- <title> 

[0047] (f) The element <ciip> may have the foiiowing attributes: 

- Id: representing the identification information for each clip. Id varies according to the type of 
storage medium (e.g., 1, 2, or 3), 

- src: representing a video object data file such as 'file1.mpg\ and 

- tmap_src: representing a file containing a time map table (e.g., file1timemap.dat) 

[0048] The element <clip> may have the following sub-elements: 

- <video>, 

- <audio>, and 

- <graphic> 

[0049] (g) The element <video> may have the following attributes: 

- resolution: image sizes in vertical and horizontal directions, e.g., 192C1080, or 1280720, 

- frame_rate: a number of frames output per second, e.g., 60, 30, 24, or 50, 

- scanning: indicating whether an image is sequentially scanned or not, i.e., whether the image 
is progressively scanned or interlaced scanned, 

- bitjrate: average bit rate, e.g., vbr, 4m, 6m, 8m, 10m, or 20m, and 

- streamjd: stream id of an MPEG PES stream, e.g., OxeO 

[0050] (h) The element <audio> may have the following attributes: 

- encoding: representing an encoding method, e.g., mp1, mp2, mp3, ac3, Ipcm, ordts, 

- sampling_rate: sampling rate, e.g., 48k, 96k, or 192k, 

- quantization_bit: a number of quantized bits, e.g., 16, or 24, 

- bit_rate: an encoded bit rate, e.g., vbr, 128k, or 384k, 

- channel_no: a total number of channels, e.g., 2, 5, or 7, 
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- language: linguistic attributes, e.g., none, en, ko, jp, or fr, 

- application: usage of an audio stream, e.g., main, sub, or commentary, 

- streamjd: stream ID of an MPEG PES stream, e.g., OxcO, or Oxbd, and 

- sub_stream_id: sub stream ID of an MPEG PES stream, e.g., none, 0x80, or OxaO. 

[0051] (i) The element <graphic> may have the following attributes: 

- encoding: representing the encoding method, e.g., dvd_subpicture, 

- application: usage of s-graphic stream, e.g., animation or subjitle, 

- language: linguistic attributes, e.g., none, en, ko, jp, or fr, 

_ ch-oam jrj; stream !D on an MPEG PES stream, e.g., OxBD, and 

- sub_streamjd: Sub stream ID on an MPEG PES stream, e.g., 0x20. 

[0052] (k) The element <title> may have the following attributes: 

- name: title name, e.g., White Snow, 

- Id: title identification information that varies according to the type of storage medium, e.g., 1, 
2, or 3. 

[0053] Also, the element <title> may have the following sub elements: 

- <chapter>. 

[0054] (I) The element <chapter> may have the following attributes: 

- name: chapter name, e.g., Dwarf, and 

- Id: chapter identification information that varies according to a title, e.g., 1 , 2, or 3. 
[0055] Also, the element <chapter> may have the following sub-element: 

- <cell>. 

[0056] (m) The element <cell> may have the following attributes: 

- clipjd: identification number of a clip to which the cell is linked, 

- startjime: starting time in a clip clipjd, and 

- end_time: ending time in a clip clipjd. 

[0057] A summary of the above attributes and elements is shown in Table 1 . 
Table 1 
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Element 


Attribute 


Sub Element 


<mdl> 




<head>/<body> 


<head> 




<meta>* 


<meta> 


name, content 


EMPTY 


<body> 




<clip>+/<title>+ 


<clip> 


id, src, tmap_src 


<video>/<audio>*/<graphic>* 


<video> 


resolution, frame_rate, scanning, 
bit_rate, stream_id 


EMPTY 


<audio> 


encoding, sampling_jate, 
quantization_bit, bit_rate, 
channel_no, language, application, 
streamjd, sub_streamjd 


EMPTY 


<graphic> 


encoding, application, language, 
streamid, sub_sireamjd 


tivir i y 


<title> 


name, id 


<chapter>+ 


<chapter> 


name, id 


<cell>+ 


<cell> 


clipjd, start_time, end_time 


EMPTY 



[0058] The additional data of FIG. 15, according to a preferred embodiment of the present 
invention, may be described using the above elements and attributes, as follows: 



<?xml version="1.0"?> 
<!DOCTYPE mdl 
PUBLIC "http://mdl-disc.org/mdl1/mdl.dtd"> 

<mdl> 
<head> 

<meta name="type" content="mdl-disc" /> 
< meta name-'title" content-White Snow Special" /> 
<meta name="region" content="1" /> 
</head> 

<body> 

<clip id="1" src="file1.mpg" tmap_src="file1tmap.dat"> 
<video resolution="1 920x1 080" frame_rate="24" 

scanning="progressive" bit_rate="vbr" stream_id="0xe0" /> 
<audio encoding="ac3" sampling_rate="48k" 
quantization J)it="1 6" bit_rate="384k" channeLno="5" 

language="en" 

application="main" stream_id="0xbd" sub_stream_id="0x80" /> 

<audio encoding="ac3" sampling_rate="48k" quantization_bit="24" 
bit_rate="384k" channeLno="5" language="ko" 

application="main" streamjd="0xbd" 

sub_streamjd="0x81" /> 

<audio encoding="ac3" sampling_rate="48k" quantization_bit="24" 
biLrate="384k" channel_no="5" language="jp" 

application="main" streamjd="0xbd" sub_streamjd="0x84" /> 
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<audio encoding= M mp1" sampling jrate="48k" quantization_bit="16" 
bit_rate="384k" channeLno="2 M language= M en" 

application="commentary" streamjd-'OxcO" 
sub_streamjd="none" /> 

<graphic encoding="dvd_subpicture" application= ,, sub_title" 
language="en" streamjd-'Oxbd" 

sub_streamjd="0x20" /> 

<graphic encoding="dvd_subpicture" application-'subjitle" 
language= H ko" streamjd="Oxbd" 

sub_streamjd= ,, 0x21 n /> 

<graphic encoding="dvd_subpicture" application= M subJitle" 
language="jp M stream_id- 'Oxbd" 

— u — ;j-«nx/00» 

</clip> 

<clip id="5" src="file5.mpg" tmap_src="filek5map.dat"> 
<video resolution^ 920x1 080" frame_rate="24" 

scanning-'progressive" bit_rate="vbr" stream_id="0xe0" /> 
<audio encoding="ac3" sampling_rate="48k" quantization_bit="16" 
bit_rate="384k" channel_no="5" language="en" 



application="main" stream_id="0xbd" sub_streamjd="0x80" /> 

<audio encoding="mp1" sampling_rate="48k" quantization_bit="16" 
bit_rate="384k" channel_no="2" language="en" 

application="commentary" stream_id="0xbd" 
sub_stream_id="0x83" /> 
</clip> 

<title id="1" name="White Snow"> 
<chapter id="1" name="Mirror"> 

<cell clip_id="1" start_time="0:00:00" 

end_time="0:05:00"/> 

</chapter> 

<chapter id="2" name="7 Dwarves"> 

<cell clip_id="1" start_time="0:05:00" end_time="0:20:007> 
<cell clip_id="2" start_time="0:00:00" end_time="1:00:007> 

</chapter> 

</title> 

<title id="2" name="White Snow Supplementary'^ 

<chapter id="1" name="How to make Snow White"> 

<cell clip_id="5" start_time="0:00:00" end_time="0:20:007> 
</chapter> 

</title> 
</body> 
</mdl> 
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[0059] In an aspect of the present invention, actual video object data includes five clips 
represented in the record units, and two titles represented in the reproduction units. 

[0060] In the case of the VBR video object data, the clip is described as time position 
information data, which includes a portion of time map information. Thus, in an aspect 
according to the present invention, each clip is described using the two attributes src and 
tmap_src, such that the clip data and the titles are linked to each other. In addition, each clip 
includes many attributes of video, audio, and graphic data, and is referred to prior to the data 
reproduction. 

[0061] Title 1 is a subject title including a plurality of chapters= The first chapter includes a 
cell linked to a portion of a clip #1 . Thus, only the portion of the clip #1 is reproduced during the 
reproduction of the first chapter. The second chapter includes two cells. The first cell 
designates the reproduction of the data after time 0:05:00 of the clip #1 . Accordingly, in order to 
start the reproduction from the second chapter, the position of desired data must be detected 
using the time map information, specifically, the time position information regarding clip #1. 

[0062] As shown in a corresponding title name attribute, title 2 includes an additional image, 
i.e., a supplementary image. The title 2 is recorded as clip #5. 

[0063] As mentioned above, improved extensibility can be achieved by describing the 
information regarding the second layer using the markup language. In other words, an 
information structure of a new concept can be described using new elements or attributes. 
Then, the existing reproducing apparatuses can reproduce the data using the existing 
information rather than newly generated information. That is, it is possible to maintain the 
reproduction of the data regarding the existing element using a conventional reproducing 
apparatus. Assuming that a new element <bookmark> is added to the <title> element and the 
following information is recorded on the second layer: 

<title id="1"... > 

<bookmark point="1:00:00" /> 

<chapter id="1"..> 

</title> 

[0064] The element <bookmark> is an information structure that is newly defined to allow a 
direct access to a particular position of the title. When the storage medium, on which the 
element <bookmark> is recorded, is inserted into the conventional reproducing apparatus, it is 
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possible to reproduce the data recorded on the storage medium using the title and the chapter 
information. However, the bookmark information is ignored because the bookmark information 
cannot be reproduced using the conventional reproducing apparatus. 

[0065] As mentioned above, the multimedia data can be reproduced using only the data 
recorded on the first and second layers. In this case, the reproducing apparatus reads the 
information recorded on the second layer, and determines the format of recorded data recorded, 
the title, and the number of chapters belonging to each title. Next, the reproducing apparatus 
informs the user of the detection result through an appropriate user interface, receives a user 
input, and reproduces the desired reproduction unit. The user interface includes a user output 
device 300 and a user input device 400 as shown in FIG. 1. In detail, the user output device 
300 is an apparatus, such as a television (TV), that outputs multimedia data, and the user input 
device 400 is an apparatus, such as a remote control, that receives a user input. The user 
interface, according to an aspect of the present invention, includes a chapter menu in a menu 
screen, which allows selection of the chapters belonging to each title. The user selects a title 
number or a chapter number from each menu using the remote control 400, resulting in the 
detection and the reproduction of a desired reproduction unit. 

[0066] The storage medium 100, according to an aspect of the present invention, further 
includes navigation information recorded on a third layer. The navigation, which is similar to a 
conventional menu, includes a selection using the user input device 400, such as the remote 
control and reproduction of the reproduction unit 200 to a user output device, such as a TV. 
Also, the navigation may include the control of the following data reproduction using a current 
state of the data reproduction. Here, the recording apparatus is as shown in FIG. 1. Also, the 
sequences of the data reproduction may be determined differently. For instance, a parental 
level control may be performed during the data reproduction, that is, the sequences of the data 
reproduction may be determined depending on whether the user is an adult or a juvenile. 

[0067] To this end, the reproducing apparatus 200 may be manufactured as show in FIG. 18. 
Referring to FIG. 18, a playback engine denotes a block to process the data recorded on the 
first and second layers. If the storage medium does not contain the data in the third layer, a 
presentation and navigation engine of FIG. 18 converts the user input into an Application 
Program Interface (API), which can be recognized by the playback engine, and provides the API 
to the playback engine. If the user input is key input and recognized by the playback engine, 
the user input is directly transmitted to the playback engine. 
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[0068] The data recorded on the third layer includes presentation data, which is used to 
arrange the menu screen or the reproduction unit in the screen, and the navigation data, which 
is used to select the reproduction unit in response to the user input or the control data 
reproduction, according to a state of a certain playback engine. The presentation data, 
according to an aspect of the present invention, is described using HyperText Markup Language 
(html) or Extensible HyperText Markup Language (xhtml). Also, the navigation data may be 
described with a Script language or a markup language capable of describing timing and 
synchronizing. A typical example of the Script language is Java script that is interpreted and 
executed in units of lines, and a typical example of the markup language having timing and sync 
definitions is synchronized markup interface language (SMIL). 

[0069] The navigation engine performs navigation by controlling the reproduction unit 
recorded on the second layer, according to the user's selection or an event generated by the 
playback engine. 

[0070] The following describes methods of laying out reproduction units, recorded on the 
second layer, in a screen using XHTML and JavaScript, and controlling navigation, according to 
an aspect of the present invention. 

[0071] An example of controlling navigation with a markup document, which allows the user 
input key to be directly transmitted to the playback engine, is as follows: 

- Markup Document 1 

<?xml version="1.0" encoding="UTF-8"?> 
<!DOCTYPE html 

PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"> 
<html xmlns="http://www. w3.org/1999/xhtml" xml:lang="en" lang="en"> 
<head> 

<title>White Snow</title> 
<meta name="Default_audio" content="0x80" /> 
<meta name="Default_subtitle" content="0x20" /> 
<script type="text/javascript"> 
function RcKeyEventHandler(evt) 

{ 

if(evt.keyCode == 10) 
MDLvideo.lnputRCKey(10); 
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if(evt.keyCode ==11) 

MDLvideo.lnputRCKey(11); 
if(evt.keyCode == 12) 

MDLvideo.lnputRCKey(12); 
if(evt.keyCode == 13) 

MDLvideo.lnputRCKey(1 3); 
if(evt.keyCode == 14) 

MDLvideo.lnputRCKey(14); 
if(evt.keyCode == 15) 

MDLvideo.lnputRCKey(1 5); 
if(evt.keyCode == 16) 

MDLvideo.lnputRCKey(16); 
if(evt.keyCode == 17) 

MDLvideo.lnputRCKey(17); 
if(evt.keyCode == 18) 

MDLvideo.lnputRCKey(18); 

} 

</script> 

<script type- 'text/javascript"> 
function setupEventHandler() 
{ 

bodyNode.addEventListener("rckeypress",RcKeyEventHandler,true); 
} 

</script> 
</head> 

<body id="bodyNode" onload="setupEventHandler()"> 
<object data="MDLvideo;" /> 
</body> 
</html> 

[0072] A key event interface of a user input device is as follows: 
interface RCKeyEvent: UlEvent { 
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const unsigned long VK_0 
const unsigned long VK_1 
const unsigned long VK_2 
const unsigned long VK_3 
const unsigned long VK_4 
const unsigned long VK_5 
const unsigned long VK_6 
const unsigned long VK_7 
const unsigned long VK_8 
const unsigned iong VK_9 
const unsigned long VK_PLAY_PAUSE 
const unsigned long VK_STOP 
const unsigned long VK_FF 
const unsigned long VK_FR 
const unsigned long VK_SKIP_PREV 
const unsigned long VK_SKIPJ\IEXT 
const unsigned long VK_SUBTITLE 
const unsigned long VK_AUDIO 
const unsigned long VKJVIENU 
const unsigned long VKJJP 
const unsigned long VKJ_EFT 
const unsigned long VK_RIGHT 
const unsigned long VKJDOWN 
const unsigned long VK_OK 
const unsigned long VK_RETURN 
const unsigned long VK_EXIT 
readonly attribute unsigned long keyCode; 
readonly attribute DOMString keyName; 
void initRCKeyEvent (in DOMString typeArg, 
in boolean canBubbleArg, 
in boolean cancelableArg, 
in unsigned long keyCode, 
in unsigned long keyName); 
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0; // keyCode is "0", keyName is "0" 

1;//"1" 

2; // "2" 

3; // "3" 

4; // "4" 

5; // "5" 

6; // "6" 

7; // "7" 

8; // "8" 

9; // ;: 9 !! 

10;//"PlayPause" 

= 11; //"Stop" 
12;//"FastForward" 
13;//"FastRewind" 
14;//"Prev" 

= 15; //"Next" 
16; //"Subtitle" 

= 17; //"Audio" 
18; //"Menu" 
19; //"Up" 

= 20; // "Left" 

= 21; //"Right" 

= 22; // "Down" 
23; // "OK" 
24; // "Return" 

= 25; //"Exit" 
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[0073] The above examples of the markup document reveals that "Markup Document 1" 
primarily includes layout information, which is related to a video object displayed by the markup 
document, and a script, which is used to control the reproduction data recorded on the second 
layer in response to the user input. Also, the key event interface of the user input device 
defines an interface that allows key values, used in the user input device, to be used in a 
document object model (DOM). 

[0074] First, the above example of the markup document includes sequentially a declaration 
that enables the use of the XHTML and an element <head>, in which the element <title>, the 
element <meta>, and the element <script> are included. In general, the element <title> 
represents the title subject of the markup document, and the element <meta> indicates default 
audio and subtitles in the video object, which is reproduced in the markup document. 

[0075] Also, event registration information according to the user input is described using 
JavaScript language, as follows: 

<script type="text/javascript ,, > 
function setupEventHandler() 
{ 

bodyNode.addEventListener("rckeypress M ,RcKeyEventHandler,true); 
} 

</script> 

[0076] The event registration information indicates that a function RcKeyEventHandler is 
called when an event rckeypress occurs, i.e., the user presses a key of the user input device 
400. 

[0077] Also, event processing information is described using the JavaScript language as 
follows: 

<script type="text/javascript"> 
function RcKeyEventHandler(evt) 

{ 

if(evt.keyCode == 10) 

MDLvideo.lnputRCKey(10); 
if(evt.keyCode ==11) 

MDLvideo. lnputRCKey(1 1 ); 
if(evt.keyCode == 12) 
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MDLvideo. lnputRCKey(1 2); 
if(evt.keyCode == 13) 

MDLvideo.lnputRCKey(13); 
if(evt.keyCode == 14) 

MDLvideo.lnputRCKey(14); 
if(evt.keyCode == 15) 

MDLvideo.!nputRCKey(1 5); 
if(evt.keyCode == 16) 

MDLvideo. lnputRCKey(1 6); 
if (evt. key Code == 17) 

MDLvideo.lnputRCKey(17); 
if(evt.keyCode == 18) 

MDLvideo.lnputRCKey(18); 

} 

</script> 

[0078] The event processing information indicates that MDLvideo.lnputRCKey(IO) is 
executed when a key code RcKeyEventHandler is 10. Here, the object MDLvideo performs 
data reproduction using the additional data stored in the second layer, that is, the object 
MDLvideo corresponds to the playback engine. In other words, if a code allocated to a key is 10 
and the user presses the key, the presentation and navigation engine of FIG. 18 sends a control 
command to the playback engine using a command API MDLvideo. InputRCKey. In response to 
the control command, the playback engine performs the reproduction control operation allocated 
to the key code 10, i.e., reproduces or temporarily stops the reproduction of a video object. 

[0079] The object MDLvideo is embedded in the element <body> of the markup document 
using the element <object>. Meanwhile, it is possible to embed a plurality of objects in the 
element <body> of the markup document. In this case, a layout of the markup document may 
use cascading style sheet (CSS). 

[0080] Next, a method of providing a specific functional command to the playback engine and 
performing navigation will be described. FIG. 19 illustrates a method of forming the menu 
screen for the navigation. 

[0081] Image and text data are described as presentation data to display on the screen. In 
accordance with an aspect of the present invention, the screen may include the text data 
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described with XHTML, or image data recorded on the first layer. Referring to FIG. 19, four 
buttons, i.e., title 1, title 2, title 3, and return, are displayed on the screen. The image or the text 
data forming the four buttons is described using XHTML. In the screen, the user can select and 
press one of the buttons. More specifically, the user applies directional keys of the user input 
device to select one of the buttons and applies an OK key to press the selected button. 
Alternatively, the screen may be constructed such that a certain operation may be executed 
when the user presses a certain button using the keys of the user input device as access keys. 
When the user selects one of the buttons, the selected button is highlighted. The above 
functions are provided by a presentation module of the presentation and navigation engine. In 
accordance with an aspect of the present invention, the presentation module may be an XHTML 
browser. If the user presses one button, the operation connected to the button is performed. 

[0082] Likewise, a command may be provided to the playback engine to reproduce 
multimedia data using the data recorded on the first and second layers. That is, the 
presentation and navigation engine may provide the playback engine with the command for 
controlling the reproduction unit recorded on the second layer. 

- Markup Document 2 

<?xml version="1.0" encoding="UTF-8"?> 
<!DOCTYPE html 

PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"> 
<html xmlns="http://www. w3.org/1999/xhtml" xml:lang="en" lang="en"> 
<head> 

<title>White Snow</title> 

<meta name="Default_audio" content="0x80" /> 

<meta name= ,, Default_subtitle" content="0x20" /> 

</head> 

<body> 

<table> 
<tr> 

<td><img src="file1.jpg M alt= n title17></td> 

<td><button type="button M name="title 1" onclick="MDLvideo.playTitle(1)7></td> 

</tr> 

<tr> 
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<td><img src="file2.jpg H alt="title27></td> 

<td><button type="button" name="title 2" onclick="MDLvideo.playTitle(2) , 7></td> 
</tr> 
<tr> 

<td><img src="file3.jpg" alt="title3 , 7></td> 

<td><button type="button" name="title 3" onclick="MDLvideo.playTitle(3)7></td> 
</tr> 
<tr> 

<td><button type="button n name^'Return" onclick="MDLvideo.play()7></td> 
</tr> 
</body> 
</html> 

[0083] The Markup Document 2 represents the menu screen illustrated in FIG. 19. Referring 
to the Markup Document 2, commands to control the reproduction units recorded on the second 
layer are described in an 'onclick' event of each button. When a button to reproduce title 1 is 
clicked, a command MDLvideo.playTitle(l) is sent to the navigation engine. Then, the 
navigation engine provides the commands and parameters to the playback engine for 
reproduction of corresponding data. For instance, playTitle is a title reproduction parameter that 
indicates title number. A function to provide such a command is called a method. 

[0084] Examples of the method are as follows: 

method for an object MDLvideo { 

playTitle(title_no): reproduce data starting from a title indicated in title_no. 

playTime(time): reproduce data starting from indicated time. 

pause(): pause data reproduction 

stop(): stop data reproduction 
play(): start data reproduction 

forward_scan(speed): high-speed forward data reproduction at a speed indicated in 
(speed) 

backward_scan(speed): high-speed backward data reproduction at a speed indicated in 
(speed) 

playChapter(chapter_no): when chapter_no == 0, move to chapter having the largest 
chapter ID and reproduce data of the chapter. If chapterjio > Maximum chapter ID, move to a 
chapter having the smallest chapter ID and reproduce data of the chapter 
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> 

[0085] If specific conditions required by the playback engine are satisfied, an event to 
process the data recorded on the third layer is generated in the presentation and navigation 
engine. For instance, whenever a chapter begins, the event is generated and provided to the 
presentation and navigation engine, and then, the presentation engine displays a start of the 
chapter in the screen. The information regarding the event that is provided to the presentation 
and navigation engine and registered to the playback engine may also be recorded on the third 
layer. 

_ Marknn nnrnmpnt 3 
...w»...~ r — ~ • - — 

<smil> 
<head> 

<meta name="title" content="White Snow7> 
<meta name="Default_audio" content="0x80 M /> 
<meta name= H Default_subtitle" content="0x20" /> 
<layout> 

<root-layout width="1920 M height= M 10807> 

<region id= ,, img_scene1" top="45" left="30" width="600 M height="3007> 
<region id="img_scene2" top="390" left="30" width= M 600" height="3007> 
<region id="img_scene3" top="735 M left="30" width="600" height="3007> 
<region id="imgJitle_button1" top="95" left="660" width="600" height="2007> 
<region id="img Jitle_button2" top="440" left="660" width= M 600" height="2007> 
<region id= ,, img_title_button3" top= , 785" left="660" width= M 600" height="2007> 
<region id="img_return n top="885" left="1290 M width="600" height= H 1507> 
</layout> 

</head> 

<body> 
<par> 

<img src="file1.jpg" region="img_scene17> 
<img src="file2.jpg M region= M img_scene27> 
<img src="file3.jpg" region="img_scene37> 

<img src="file4.jpg?url=command:MDLvideo.playTitle(1 )" region="imgJitle_button17> 
<img src="file5.jpg?url=command:MDLvideo.playTitle(2) M region="img_title_button27> 
<img src="file6.jpg?url=command:MDLvideo.playTitle(3)" region="img_title_button37> 



25 



1293.1723 

<img src="file7.jpg?url=command:MDLvideo.play()" region="img_return7> 
</par> 
</body> 
</smil> 

[0086] The Markup Document 3 represents the data that is stored in the third layer and 
described using SMIL, which is the markup language with timing and synchronization functions. 
SMIL is also largely divided into the elements <head> and <body>. In the Markup Document 3, 
the element "head" includes elements "meta" and "layout". The element "meta" has already 
been described with respect to the above markup documents and its description will be omitted 
here. The element "layout" has children elements "root-layout" and "region". The element 
"root-layout" describes a size and a background color of a document to be displayed. The 
element "region" describes the layout of a SMIL document region where each media clip is 
formed, and an ID of the respective region. 

[0087] The element "body" includes the following elements, which describes media clip 
sources: 

<animation /> - Shockwave Flash File (.swf) 

<audio /> - Audio Clip File (.rm) 

<img /> - Image Clip File (jpg.gif) / Not Use Animation GIF 

<ref /> - Document of Different Use (.htm.rp ) 

<text /> - Text (.bet) 

<textstream..../> - Text Stream (.rt) 
<video /> - Video Clip (.rm) 

[0088] In the Markup Document 3, the element "img" is used to create the menu screen 
shown in FIG. 19. The element "img" can be used to link a document to a desired image or 
insert a command into the document. Also, Markup Document 3 may further include elements 
"a" and "anchor for use as a hyperlink. In Markup Documents, all images are linked except 
three images. In SMIL, a reproduction control attribute "url" in the element "img" may have the 
following attribute values, including playTitle: 

url :: hyperlink URL. 

command:MDLvideo.playTitle(titlejio): reproduce data starting from a title indicated in titlejio. 
command:MDLvideo.playTime(time): reproduce data starting from time indicated in (time). 
command:MDLvideo.play(): start data reproduction. 
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command:MDLvideo.stop(): stop data reproduction. 
command:MDLvideo.pause(): pause data reproduction. 

command:MDLvideo.forward_scan(speed): high-speed forward reproduction at a speed 
specified in (speed). 

command:MDLvideo.backward_scan(speed): high-speed backward reproduction at a speed 
specified in (speed). 

command:MDLvideo.playChapter(chapter__no): if chapter_no == 0, move to a chapter having 
the largest chapter ID and reproduce data of the chapter. 

If chapter_no > Maximum chapter ID, move to a chapter having the smallest chapter ID and 
reproduce data of the chapter. 

[0089] As described above, according to a storage medium of the present invention, 
information regarding reproduction of multimedia data is divided into record units and 
reproduction units, and additional information regarding the record units and reproduction units 
are recorded on a second layer. Also, information regarding selection and navigation of a 
desired reproduction unit is recorded on a third layer. In this way, the operation of each layer 
can be distinguished. In the second layer, the data is recorded using a markup language to 
improve extensibility. In the third layer, the data is also recorded using the markup language to 
create additional data, representing a menu screen or the layout of a reproduction unit. The 
data regarding selection of the reproduction unit and the reproduction sequence is described 
using a script language or the markup language with timing and synchronization functions. 

[0090] In order to store the multimedia data in the storage medium, a multimedia data 
recording apparatus, according to an aspect of the present invention, records the multimedia 
data on the first layer of the storage medium, divides the additional information of the 
multimedia recorded on the first layer into the record units and the reproduction units, and 
records the divided additional information on the second layer in a table format or by using the 
markup language. Here, the record units and the reproduction units may have multiple 
hierarchical structures. According to an aspect of the present invention, the record unit is a clip 
that is made by linking time information to position information when video object data is 
recorded at VBR, and the reproduction units are cells linked to the clip or a portion of the clip, 
chapters linked to a plurality of cells, and titles linked to a plurality of chapters. 

[0091] In addition, navigation data regarding selection of the reproduction unit or the 
reproduction sequence is recorded on the third layer. The navigation data is described using 
either the script language executed in units of lines, or the markup language with timing and 
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synchronization functions. Also, in the third layer, presentation data, representing a menu 
screen to select the reproduction unit or the reproduction sequence, is described using the 
markup language. 

[0092] A storage medium, according to an aspect of the present invention, includes the 
multimedia data and the additional data. The additional data includes two different types of 
data: the additional information, which includes the record unit, attributes, and the reproduction 
unit of the multimedia data, and the navigation information, which relates to a selection of the 
reproduction unit and the reproduction sequence. The additional information is described using 
the markup language, thereby enabling an addition of new multimedia data regardless of an 
extension of a standard. Also, even if a new record unit or reproduction is prescribed, it is easy 
to support implementation of the extension. Further, both or one of the markup language and 
the script language is used to describe navigation data, which represents selection of the 
reproduction unit or the reproduction sequence. The markup language is also used to describe 
presentation data, which represents the menu screen to select the reproduction unit and a 
screen layout for the data reproduction, thereby enabling a menu structure and navigation with a 
high degree of flexibility. 

[0093] While this invention has been particularly shown and described with reference to 
[preferred embodiments] aspects thereof, it will be understood by those skilled in the art that 
various changes in form and details may be made therein without departing from the spirit and 
scope of the invention as defined by the appended claims. 
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